Interpretable deep learning: interpretation, interpretability, trustworthiness, and beyond
نویسندگان
چکیده
Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to over-parameterized black-box nature, it is often difficult understand the prediction results deep models. In recent years, many interpretation tools proposed explain or reveal how models make decisions. this paper, we review line research try a comprehensive survey. Specifically, first introduce clarify two basic concepts—interpretations interpretability—that people usually get confused about. To address efforts in interpretations, elaborate designs number algorithms, from different perspectives, by proposing new taxonomy. Then, results, also survey performance metrics evaluating algorithms. Further, summarize current works models’ interpretability using “trustworthy” Finally, discuss connections between interpretations other factors, such as adversarial robustness several open-source libraries algorithms evaluation approaches.
منابع مشابه
Beyond Sparsity: Tree Regularization of Deep Models for Interpretability
The lack of interpretability remains a key barrier to the adoption of deep models in many applications. In this work, we explicitly regularize deep models so human users might step through the process behind their predictions in little time. Specifically, we train deep time-series models so their classprobability predictions have high accuracy while being closely modeled by decision trees with ...
متن کاملInterpNET: Neural Introspection for Interpretable Deep Learning
Humans are able to explain their reasoning. On the contrary, deep neural networks are not. This paper attempts to bridge this gap by introducing a new way to design interpretable neural networks for classification, inspired by physiological evidence of the human visual system’s inner-workings. This paper proposes a neural network design paradigm, termed InterpNET, which can be combined with any...
متن کاملInterpretable Deep Learning applied to Plant Stress Phenotyping
Availability of an explainable deep learning model that can be applied to practical real world scenarios and in turn, can consistently, rapidly and accurately identify specific and minute traits in applicable fields of biological sciences, is scarce. Here we consider one such real world example viz., accurate identification, classification and quantification of biotic and abiotic stresses in cr...
متن کاملA Deep Learning Interpretable Classifier for Diabetic Retinopathy Disease Grading
Deep neural network models have been proven to be very successful in image classification tasks, also for medical diagnosis, but their main concern is its lack of interpretability. They use to work as intuition machines with high statistical confidence but unable to give interpretable explanations about the reported results. The vast amount of parameters of these models make difficult to infer ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge and Information Systems
سال: 2022
ISSN: ['0219-3116', '0219-1377']
DOI: https://doi.org/10.1007/s10115-022-01756-8